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The factor complexity of the infinite word canonically associated to a non-simple Parry 
number /3 is studied. Our approach is based on the notion of special factors introduced by 
Berstel and Cassaigne. At first, we give a handy method for determining infinite left special 
branches; this method is applicable to a broad class of infinite words which are fixed points 
of a primitive substitution. In the second part of the article, we focus on infinite words 
only. To complete the description of its special factors, we define and study (a, 6)-maximal 
left special factors. This enables us to characterize non-simple Parry numbers /? for which the 
word has affine complexity. 

1 Introduction 

The aim of this work is to compute the factor complexity function C(n) of the infinite word 
associated with /3-expansions ^25], where /3 is a non-simple Parry number. The definition 
of Parry numbers is connected with the Renyi expansion of unity Parry numbers are 

those (3 for which d/^{l) is eventually periodic. Positional numerical systems with a Parry 
number as a base have a nice behavior. For example, if we consider /3-integers, i.e., real 
numbers with vanishing /3-fractional part in their /3-expansion, then the distances between 
two consecutive /3-integers take only finitely many values. In fact, this property can be used as 
an equivalent definition of Parry numbers. In this sense, positional numeration systems based 
on Parry numbers are a natural generalization of the classical decimal or binary systems. Let 
us mention that even the innocent looking rational base /? = § brings into numeration systems 
phenomena never observed before |1J . 

The most prominent Parry number is the golden mean r = with ^,-(1) = 11. The 

infinite word associated to r is the famous Fibonacci chain, i.e., the word generated by the 
substitution i-^ 01 and 1 i-^ 0. The Fibonacci chain codes the distances between r-integers. 
Fabre in [12] showed that for any Parry number there exists a canonical substitution over a 
finite alphabet such that its unique fixed point up represents the distribution of /3-integers on 
the real line. 

/3-integers attracted attention of physicists after the discovery of quasicrystals in 1982 |25j . 
r-integers were shown to be a suitable tool for describing atomic positions in solid materials 
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with long range order and non-crystalographical five-fold symmetry [20], [3]. The knowledge 
of the factor complexity of the Fibonacci chain is the first step towards the description of 
variability of local configurations in quasicrystals [19] . 

Parry numbers are split into two groups: a Parry number /? is called simple if the Renyi 
expansion of unity dp{l) has only a finite number of nonzero elements, otherwise f3 is non- 
simple. The questions concerning the factor complexity of words associated with simple 
Parry numbers were discussed in [15] and [1]. Of course, since among one can find some 
Sturmian sequences and Arnoux-Rauzy words, the complexity of for some specific values 
of f3 were known earlier. 

The first non-simple Parry number /? for which the factor complexity of u/j was precisely 
determined is such that fi^(l) = 2(01)'^, i.e., /3 is a root of the polynomial — Ix^ — x -|- 1. 
This non-simple Parry number appears naturally when describing the model of quasicrystals 
with seven-fold symmetry [2] . The first attempt to study factor complexity of for broader 
class of non-simple Parry number can be found in . 

Since any infinite word is the fixed point of a primitive substitution, the factor com- 
plexity of can be estimate from above by a linear function, see [23]. Moreover, we know 
that the first difference of complexity is bounded by a constant [8] [21]. Nevertheless, in 
general, it is hard to find an explicit formula for the complexity function of an infinite word 
u and it seems it holds also for the case of u^. However, we are able to find all left special 
factors that, in a certain sense, completely determine the factor complexity. The notion of 
(right) special factor was introduced by Berstel [5] in 1980 and considerably enhanced by 
Cassaigne in his paper [9] in 1997. We introduce another slight enhancement, a tool that will 
help us to identify all infinite left special branches of fixed point of substitutions satisfying 
some natural assumption. Further, the knowledge of the structure of left special factors will 
allow us to identify all non- simple Parry numbers (3 for which the complexity of is affine: 
The complexity of is affine if and only if ti(0 • • • 0(ti — l))"^ (Theorem 57). 



2 Parry numbers and associated infinite words 

For each x £ [0, 1) and for each /? > 1, using a greedy algorithm, one can obtain the unique 
(5-expansion {xi)i>i,Xi £ N, of the number x such that 

x = Y^ Xip-' and ^^(^'' < P'^'^^ ■ 

i>l i>k 

By shifting, each non-negative number has a /^-expansion. For x S [0, 1), the /3-expansion can 
be computed also by using the piecewise linear map : [0, 1) — > [0, 1) defined as 

T^{x) = {(3x}, 

where {(3x} is the fractional part of the real number The sequence djs^x) = X1X2X3 • • • is 
obtained by iterating with 

xi=iprf\x)\. 
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The difference between /3-expansion and djj^x) arises for x = 1 since the Renyi expansion of 
unity d/3(l) is not a /3-expansion. Parry [22] showed that plays a very important role in 

the theory of /3- numeration. Among other things, it allows us to define Parry numbers. 

Definition 1. A real number P > 1 is said to be a Parry number if dp{l) is eventually 
periodic. In particular, 

a) if dp^l) = ti ■ ■ ■ tm is finite, i.e., it ends in infinitely many zeros, then j3 is a simple 
Parry number, 

b) if it is not finite, i.e., dp{l) = ti - ■ ■ tm{tm+i ■ • • tm+p)^ , then f3 is called a non-simple 
Parry number. 

Note, that the parameters m,p > are taken the least possible. It implies that t^ ^ tm+p 
which will be a very important fact. Another crucial property of is the following Parry 

condition [22] valid for all /? > 1 

tjtj+itj+2 ■■■ -< ht2t3 ■■■ for every j > 1 , (1) 

where -< is the (strict) lexicographical ordering. 

As the infinite word is tightly connected with a geometrical interpretation of /3-integers, 
we first introduce /3-integers along with some of their properties. 

Definition 2. The real number x is a /3-integer if the [i-expansion of \x\ is of the form 
Z]i=o "^j/^' - The set of all (3-integers is denoted by Z^. 

The definition of /3-integers coincides with the definition of classical integers in the case of 
(5 in Z. But there are several new phenomena linked with the notion of /^-integers when (3 is 
not an integer. For our purposes, the most interesting difference between classical integers and 
/3-integers is the difference in their distribution on the real line. While the classical integers 
are distributed equidistantly, i.e., gaps between two consequent integers are always of the 
same length 1, the lengths of gaps between /?- integers can take their values even in an infinite 
set. More precisely, Thurston [26j proved the following theorem. 

Theorem 3. Let (5 > 1 be a real number and dfs{\) = (tj)j>i. Then the length of gaps between 
neighbors in takes values in the set {Aq, Ai, . . .}, where 

fc>i '^'^ 

Corollary 4. The set of lengths of gaps between two consecutive ^-integers is finite if and 
only if (3 is a Parry number. Moreover, if [3 is a simple Parry number, i.e., dp{\) = 
ti - ■ ■ tm, the set reads {Aq, Ai, . . . Am_i}, if 13 is a non-simple Parry number, i.e., d/3(l) = 
■ tm+pY, we obtain {Aq, Ai, . . . A^+p-i}. 
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Now, let us suppose that we have drawn /3-integers on the real line and assume that 
/3 is a Parry number. If we read the length of gaps from zero to the right, we obtain an 
infinite sequence, say {Ai^}k>o- Further, if we read only indices, we obtain an infinite word 
over the alphabet {0, . . . , m — 1} in the case of simple Parry numbers, and over the alphabet 
{0, . . . ,m + p — 1} in the non-simple case. The obtained infinite word is just the word we 
are interested in. However, there exists another way to define it. Fabre [12] proved that 
can be defined as the unique fixed point of a substitution (/?^ canonically associated with a 
Parry number /3 and defined as follows. 

Definition 5. For a simple Parry number (5 the canonical substitution ip^ over the alphabet 
^ = {0,l,...,m — 1} is defined by 

V^/3(0) = 0*11 

(pp{m — 2) = 0*™-i(m— 1) 
(/?/3(m-l) = 0*™ 

Definition 6. For a non-simple Parry number (3 the canonical substitution ipp over the al- 
phabet ^ = {0, 1, . . . , m + ]3 — 1} is defined by 

= 0*^2 

(Pf3{m—1) = 0*™m 
ifp{m) = 0*™+i(m-|-l) 

ipi3{m+p — 2) = 0*'"+p-i(m-|-p— 1) 
ipf3{m+p—l) = 0^^+Pm. 

We see that the definition of is given by (i/3(l) and that the only difference between 
simple and non-simple cases lies in the images of the last letters m — 1 and m-\- p — 1 . While 
in the simple case the last letters of images ipjs^k), /c = 0, 1, . . . , m — 1, are all distinct and so 
the images form a suffix-free code, in the non-simple case either f 13(171) = 0*™m is a prefix of 
^Ppijn + p — 1) = 0*"^+pm or vice versa. As we will see later on, this property is crucial from 
the point of view of computing the complexity of the infinite word u^. 

Definition 7. Let (3 > 1 be a Parry number. The unique fixed point of the canonical substi- 
tution ipp is denoted by 

Uf,= lim (^^(0)=(^^(0). 

The uniqueness of follows from the definitions of (pp, the letter is the only admissible 
starting letter of a fixed point. 
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3 Special factors and factor complexity 



In this section, we will recall the notion of special factors of an arbitrary infinite word and 
we will explain how the structure of special factors of an infinite word determines its factor 
complexity. To be able to do it, we need some usual basic notation, see [9^ for more. 

Definition 8. Let A = {0, 1, . . . ,q — 1}, q > 1 be a finite alphabet. An infinite word over the 
alphabet A is a sequence u = {ui)i>i where Ui G A for all i > 1. If v = UjUj+i ■ ■ -Uj+n-i, 
j, n > 1, then v is said to be a factor of u of length n and the index j is an occurrence of v, 
the empty word e is the factor of length 0. 

By Ln{xi) we denote the set of all factors of u of length n G N, the language of u is then 
the set i(u) = U„gpji„(u). 

Definition 9. Let u be an infinite word over an alphabet A. The function C{n) = 7^i„(u) is 
the factor complexity function of u. We further define the first difference of the complexity 
by AC{n) = C{n + 1) - C{n) . 

In what follows, we shall restrict ourself to those infinite words which are fixed point of 
some substitution (morphism) if defined over a finite alphabet A. We shall further assume 
that (fi is injective and primitive. 

Definition 10. A substitution cp is primitive if there exists k £N such that for all a,b £ A 
the word tp^{a) contains b. 

Equivalently, (p is primitive if the incidence matrix M<^ is primitive. 
There are several well-known properties of the complexity function C. 

Proposition 11. 

(i) For each infinite word u, < C(n) < {^A)"' , 
(a) if u is eventually periodic then C(n) is eventually constant, 

(Hi) u is aperiodic in and only if C{n) is unbounded and C{n) is unbounded if and only if 
AC{n) > 1, for all n G N, 

(iv) if u is a fixed point of a primitive substitution then C(n) is a sublinear function, i.e., 
C{n) < an + b, for some a, 6 G N, 

(v) if u is a fixed point of primitive substitution then AC(n) is bounded. 

Items (i) — (Hi) are obvious, {iv) is due to |23j, (f) was proved in [21j and in a more 
general context in [8j. 

It is also well known that any fixed point of a primitive substitution is uniformly recur- 
rent, i.e., each factor occurs infinitely many times and the gaps between its two consecutive 
occurrences are bounded in length. It implies that each factor is extendable both to the right 
and to the left. 
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Definition 12. Let v be a factor of u, the set of left extensions of v is defined as 

Lext(w) = {a £ A \ av £ i(u)}. 

// #Lext(t') > 2, then v is said to be a left special (LS) factor of u. 

In the analogous way we define the set of right extensions Rext(u) and a right special 
(RS) factor. // v is both left and right special, then it is called bispecial. 

The connection between (left) special factors and the complexity follows from the following 
reasoning. Let us suppose that L„(u) = {vi, . . . , Vk}, k > 1 and let Lext(t;j) = {a^i \ . . . , a^*^}, 
it > l,i = 1, ■ ■ ■ ,k. Now, it is not difficult to realize that 

T / ^ r W (1) (2) (fc-1) (fc) (fc) . 

L„+i(u) = {al 'vi, . . .,a\^vi,a\ 'v2, . . . ,a}^_^'vk^i,a\ 'vk, . . .,aljvk}, 

i.e., by concatenating all factors of length n and all their left extensions we obtain all factors 
of length n + 1 . It implies that 

#Ln+i(u) - #L„(u) =AC(n) = Yl (#Lext(?;) - 1). (2) 

t)eL„(u) 

V is LS 

Hence, if we know all LS factors along with the number of their left extensions, we are able 
to evaluate the complexity C(n) using this formula. 



3.1 Classification of LS factors 

Let a,b £ Lext(f ) be left extensions of a factor v of u, it means both av and bv are factors of 
u. If there exists a letter c G Rext(at;) nRext(6u), we say that v can be extended to the right 
such that it remains LS with left extensions o, 6, indeed a,b € Lext(f;c). 

Definition 13. Let a,b £ Lext(t;) be distinct left extensions of a LS factor v of u. We 
say that v is an (a, 6)-maximal LS factor if Rext(af ) R Rext(6t') = 0, in words, v cannot be 
extended to the right such that it remains LS with left extensions a, b. 



a 



V 



y 



d 



a 

b 
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y 



d 



a) 



b) 



Figure 1: Two types of (a, 6)-maximal LS factor v. 

In general, there are two types of (a, 6)-maximal LS factors both depicted in Figure [l] In 
Case a), a and b are only left extensions of v and so v cannot be extended to the right and 
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remain LS. In Case b), v can be prolonged by letter e such that ve is still a LS factor but it 
looses its left extension a. 

It can also happen that a factor v with left extensions a and b is extendable to the right 
infinitely many times. In this way we obtain an infinite LS branch. 

Definition 14. An infinite word w is an infinite LS branch of u if each prefix of w is a LS 
factor of u. We put 

Lext(w) = Lext(u). 

V prefix of w 

Proposition 15. 

(i) If u is eventually periodic, then there is no infinite LS branch of u, 

(a) if u is aperiodic, then there exists at least one infinite LS branch of u, 

(Hi) if u is a fixed point of a primitive substitution then the number of infinite LS branches 
is bounded. 

(i) is obvious, (Hi) is a direct consequence of ^ and Proposition 11 (v). Item (ii) is a 
direct consequence of the famous Konig's infinity lemma [I^ applied on sets Vi, V2, • • •, where 
the set Vfc comprises all LS factors of length k and where f 1 E is connected by an edge with 
V2 £ Vi+i if ^1 is prefix of V2- 

Taking all together, our aim is to find all (a, 6)-maximal LS factors and also all infinite 
LS branches of u. 

Remark 16. The term "special factor" (for us it was RS factor) was introduced in 1980 |^ 
and it has been used for computing the factor complexity since then (eg. JBI, fH^)- The 
notations introduced above are based on Cassaigne's article f^. An {a, b) -maximal factor is 
a new term, actually it is a special case of a weak bispecial factor proposed there. It is also 
shown in the article that bispecial factors determine the second difference of the complexity in 
a similar way as LS factors determine the first difference of the complexity. 

Remark 17. Everything what has been (and will be) defined or showed for LS factors can be 
defined or showed similarly for RS factors. 



3.2 How to find infinite LS branches 

Before introducing a new notion, let us consider the example substitution 

if-.l^ 1211,2 311,3 2412,4 435,5 534 (3) 

with u = 99°°(1). Further, let t;; be a LS factor (or infinite LS branch) of u with left extensions 
1 and 2. Is ip{w) again LS factor? From Figure [2] (first line) we see that it is not since the 
letter 1 is its only left extension. In order to obtain a LS factor, we have to prepend the 
factor 11 which is the longest common suffix of (p{l) = 1211 and f(2) = 311, then llip{w) is 
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/^(l, 2) = 11 



1 (y9-image 1211. 

>w ► >a?(w) 

2 / 311 / 



/-image 



^llc^(w) 



;^(1, 2) = {2, 3} 



/Z.(2, 3) = e 



311 



24 



\¥j(w) 
12 / 



/-image 



;^(2, 3) = {1, 2} 



Figure 2: Images of LS factors. 

a LS factor with left extensions 2 and 3. In the case when Lext(?i;) = {2,3} (second line in 
Figure [2|, ip{w) is a LS factor since the longest common suffix of ^p{2) = 311 and 93(3) = 2412 
is the empty word e. 

Definition 18. Let ip be a substitution defined over an alphabet A. For each couple of distinct 
letters a,b £ A we define fL{cL, b) as the longest common suffix of words ip{a) and ip{b). 

Definition 19. Let v be a prefix of a word w, then v^^w is the word w without the prefix v. 
Analogously, we define wv~^, if v is a suffix of w. 

Definition 20. Let ip be an injective substitution defined over an alphabet A having a fixed 
point u. For each unordered couple of distinct letters a,b £ A such that Rext(a) nRext(6) 7^ 
we define the set giicL, b) as follows. 

(i) If fL{o,,b) is a proper suffix of both p{a) and (p{b), then gL(a,b) contains just the last 
letters of factors (p{a){fL{a,b))~^ and p{b){fL{a,b))~^ . 

(a) If fL{a,b) = (p{a) (i.e., W.L.O.G. \p{a)\ < \ip{b)\), then gL{a,b) contains the last letter 
of the factor ip{b){fL{a,b))~^ and all the last letters of factors ip{c), where c £ Lext(a) 
such that Rext(ca) n Rext(6) / 0. 

Assumption 21. A substitution (p defined over A is injective and it has a fixed point u such 
that for all a,b £ A, for which gL{a, b) is defined, it holds that i^giia, b) = 2. 

Moreover, if fiicL, b) = ip{a) and d is the last letter of the factor ip{b){fL{a, b))^^ , then for 
all c £ Lext(a) such that Rext(ca) H Rext(6) it holds that d is not the last letter of (p{c). 



Assumption 21 is valid for all suffix-free substitutions since gL{a,,b) from point (i) of 



Definition 20 contains always just two elements and the case when fL{o.jb) = >p{a) never 



happens. If /L(a, b) = <p{a), then Assumption 21 says that if u is a LS factor with Lext(f ) 



{a, 6}, then the last letter e of 'p{c) is the same for all c £ Lext(at') and, moreover, eip{a) is 
not a suffix of <p{b) - in other words, for each LS factor v the factor /^(a, b)ip{v) is again LS. 
We will see that this complicated assumption is satisfied for the (not suffix-free) substitution 
ipj3, where /3 is a non-simple Parry number. 
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Figure 3: The graph GL^ for the Substitution ([s]). 



Definition 22. Let ip be a substitution satisfying Assumption 21 Then for each LS factor 
(or infinite LS branch) w having distinct left extensions a and b we define /-image of w as 
the factor b)ip{w). 



With respect to the preceding discussion, Assumption 21 says that /-image is always LS 
factor and it has just two left extensions, namely two elements of gL{a,b), corresponding to 
two original left extensions a and b. 



Assumption 21 along with the notation introduced above allow us to define the following 
graph. 



Definition 23. Let ip be a substitution defined over an alphabet A satisfying Assumption 21 
We define a directed labelled graph GLi^as follows: 

(i) vertices of GL^ are couples of distinct letters a, b such that Rext(a) Pi Rext(6) ^ 0, 

(a) if gLicL,b) = {c,d}, then there is an edge from a vertex (a, 6) to a vertex {c,d) labelled 
by fL{a,b). 



In fact, the crucial result of Assumption 21 is that out-degree of each vertex is exactly one. 



The graph GL^p for our example substitution is drawn in Figure |3j this substitution satisfies 



Assumption 21 for it is suffix-free. 

Now, let us consider the case when w is an infinite LS branch with a,b £ Lext(w), a ^ b. 
Obviously, /-image of w is uniquely given. For most substitutions even a "/-preimage" of 
each infinite LS branch exists. 

Assumption 24. An infinite word u is a fixed point of a substitution ip satisfying Assump- 



tion 21 For each infinite LS branch w of u with a,b £ Lext(w),a 7^ h there exists at least 
one infinite LS branch w with left extensions c and d such that f -image of w equals w and 
gL{c,d) = {a,b}. 
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This assumption is very weak. Actually, we have not found any primitive substitution not 
satisfying it. The reason for it is the following. It is not difficult to prove (but it requires a 



lot of new notation) this: Disruption of Assumption 24 implies that each factor of u can be 



decomposed into images of letters a G A, in at least two different ways. Thus, in order to 



prove that Assumption 24 is satisfied, it suffices to find any factor with unique decomposition 
to images of letters. For the example substitution ([s]) the factor 1211 is such a factor since it 
can arise only as (p{l), an example of a factor having two different decomposition is 1112, it 
is part of both (p(l)ip(l) and (p{3)ip{l). In the case of ip/^, we will prove that Assumption 24 
is satisfied differently. 

Theorem 25. Let u be a fixed point of a primitive injective substitution ip satisfying Assump- 



tion 24 and let w be an infinite LS branch with a,b & Lext(w),a 7^ b. Then either w is a 
periodic point of ip, i.e, 

w = 93^(w) for some ^ > 1, (4) 

and (a, b) is a vertex of a cycle in GL^p labelled by e only or-w = SLp\s)ipP'^{s) ■ ■ ■ is the unique 
solution of the equation 

w = sip^{w), (5) 

where (a, b) is a vertex of a cycle in GL^ containing at least one edge with non-empty label, 
i is the length of this cycle and 

s = fUgtHa, b))--- ^'-^fUgUa, b))^'~HfL{a, b)). (6) 

Proof Due to Assumption |24[ both the /-image and the /-preimage of w exist. The /-image 



is unique due to Assumption 21 and the uniqueness of /-preimage follows from the fact that 
the number of infinite LS branches is finite. Thus, /-image is one-to-one mapping on the 
finite set of all ordered couples 

{((c,d),w)}, 

where w is an infinite LS branch of u and (c, d) is an unordered couple of letters such that 
c,d£ Lext(w)), c 7^ d. The /-image can be viewed as a permutation on this finite set and so 
it decomposes the set to independent cycles as depicted in Figure |4j £ is then the length of 
the cycle containing ((a, 6),w). 

As explained earlier, applying /-image on a LS factor having left extensions a, b corre- 
sponds to the movement along the edge in GL^ which leads from (a, b). If the labels of the 
edges of the cycle are all e, then the /-image coincides with ip and so we obtain the periodic 
point Q. If at least one edge is labelled by non-empty word, then i is also length of the cycle 
in GL^ containing the vertex (a, b) and Equation ^ then corresponds to £-times applying of 
/-image on w. 

□ 



Our example substitution ip (see (|3|) has five periodic points 

cp^il),ip^i4),ip^{5),{ip'r{2),{ip'ri3). 
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/-image 



/-image 

/-image 

Figure 4: Circular structure of infinite LS branches. 

It is an easy exercise to show that 

Lext(l) = {l,2,3,4,5},Lext(2) = {1, 4, 5}, Lext(3) = {1,4,5}, 

Lext(4) = {l,2,3},Lext(5) = {1,2,3}. 

Looking at the graph GL^p depicted in Figure [s] we see that (/?°°(4), (/?°°(5) are not infinite LS 
branches as none of the vertices (1,2), (2, 3) and (1, 3) is a vertex of a cycle labelled by e only. 
Hence, only <^°*^(1), (99^)°^(2), (93^)°°(3) are infinite LS branches with left extensions 1,4,5. 

As for infinite LS branches corresponding to Equation (|5]), in the case of our example, 
there is only one cycle not labelled by the empty word only between vertices (1,2) and (2,3). 
There are two (= the length of the cycle) equations corresponding to this cycle 

w = 99(ll)99^(w) and w = ll(/?^(w). 

They give us two infinite LS branches 

<^(ll)(^3(il)^5(il)... ^ 
lV(ll)V9^11)..., 

the former having left extensions 1 and 2 and the latter 2 and 3. 



Remark 26. Assumption 21 can he reformulated into a weaker form hut to do so, it would 



require to introduce rather complicated notation. The important fact is that the canonical 



suhstitution ipp satisfies Assumption 21 



4 Infinite LS branches of u 



At first, let us recall known results for simple Parry numbers. The substitution ipj^ from 
Definition |5] is suffix- free and it implies that it satisfies Assumption 21 One can easily prove 



that even Assumption 24 is satisfied. As mentioned earlier, the last letters of images of letters 
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are all distinct and so fh^o-ib) = e for all couples a,h ^ A. The graph GL^^ then looks as 
in Figure [5j It contains m — 1 cycles labelled by e only and hence the only candidate for 
being an infinite LS branch is the unique fixed (and periodic) point of ipp, namely with 
Lext(u^) = A. The same result is proved in [15] using different techniques. 



= 1 , . . . , m — 1 






1 


e 


m — 


fc- 1 


^ 


k + 1 




m — 


1 






e 




m — k 








Figure 5: GL^^ for simple Parry /3. 



Theorem 27 (Il],[l5]). Let (5 > 1 he a simple Parry number with (i^(l) = ti • • • tm (ind let 
Up he the fixed point of the canonical substitution ipjj Then 

(i) if ti = t2 = • • • = tm-i or ti > max{t2, • • • , tm-i}, the exact value of C{n) is 
known [TSf . 

(a) in particular, {m — l)n + 1 < C(n) < mn, for all n>\, 
(Hi) C{n) is affine 44> the following two conditions are fulfilled 

1) tm = 1 

2) for all i = 2,3, ... ,m — l we have 

titi+l ■ ■ ■ tm-iti ■ ■ ■ ti-l :< tit2---tm-i- 

Then C{n) = {m — \)n + 1. 

In this paper, we will find the necessary and sufficient condition for the complexity being 
affine in the case of non-simple Parry numbers. We will see that it is more restrictive than 
the one from point {Hi). 



4.1 Infinite LS branches in case of non-simple Parry numbers 

In this section, we will apply hitherto introduced theory on the fixed point of the substi- 
tution ^pp, where /? is a non-simple Parry number. To be able to do so, we need some more 
notation and simple but useful technical lemmas. 

Definition 28. For all k,£ eN, we define the addition (B : N x N —>■ A as follows. 



k(Bi:= 




' if k + i < m + p, 

(k + i — m mod p) otherwise. 
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Similarly, if used with parameters ti, we define for all k,i & N,k + i > 



tk+e ifO<k + i<m + p + l, 

t^+i+(k+£-m~i mod p) otherwise. 



In fact, the addition © tracks the last letters of the words (/7^(0), n = 0, 1, . . .. Therefore, 
we can rewrite the definition of the substitution (/? in a simpler form 

<^^(A;) =0*'=+i(fc©l), yk£A. 

Further, employing the new notation and the definition of the substitution 99^, one can easily 
prove the following simple observations. 

Lemma 29. For the substitution ipp it holds 
(i) for all n £N and for all k £ A 

^^(A;) = (^;^-i(0))*'=®i('^r^(°))*'®' ■ ■ ■ (v2/3(0))*'=®("-i'0*'=®"(fc © n), 

(a) if avb is a factor 0/ u^, v € .4* and a,b ^ 0, then there exists unique factor v' such 
that '^(^{v') = vb. 

Our aim is to obtain the graph GL^^, thus, we need to know left extensions of letters and 
also all gL{ci, b). 

Definition 30. Let us define for all k £ A, k ^ 0, a function z : {1, . . . ,m + p — 1} 
{0,l,...,m + p-2} by 

z{k) = max{j G N | O-' is a suffix of tit2 • • • tfc}. 

For k £ {m, . . . , m + p— 1} we also define a function y : {m, . . . , m + p — 1} — > {0, 1, . . . ,p—l} 
by 

max{j e N I O-'' is a suffix of tm+itm+2 ■ ■ ■ tm+ptm+i ■■■tk} if k > m, 
max{j G N I O-' is a suffix of tm+itm+2 ■ ■ ■ tm+p} if k = m. 

Further, we define 

, jo ifti>l, 
[ 1 + maxjj G N I O-' is a prefix of t2t3 ■ ■ ■ tm} otherwise 

and finally we put t = mm{tm, tm+p}- 

Note that z{k) and y{k) can return the same value for k > m, & necessary condition for 
z{k) 7^ y{k) is that t = and z{£) 7^ y{i) for all m < I < k. Due to Parry condition ([T| we 
must have l<£o<m — las the case dp{l) = 10 • • • 0{tm+i • • • tm+p-il)^ is not admissible. 

Lemma 31. For the fixed point of ipp it holds 



y{k) 
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(i) Lext(O) = {£o, ...,m + p-l}, 

(ii) Lext(A;) = {z{k)], for k e {2,3, . . . ,m - 1}, 

(in) Lext(A;) = {z(k),y(k)}, for k G {m,m + \ . . . ,m + p — 1}. 

Proof, {a) Each letter A: > can appear in as the image of /c — 1, namely (^/^(A; — 1) = 0*'=/?. 
litk > 0, then G Lext(A:), if tfc = we consider ipj^{k-2) = (/?^(0*'=-i)A; = {^{if'^-'^k. Again, 
if tfc-i > 0, then 1 G Lext(fc), otherwise we continue in the same way. Since ti > 0, this 
process is finite. 

{in) The letter m can appear in not only as an image (p/3{m — 1) (i.e., case {ii)) but 
as well as ^ppim + p — 1) = 0*™+Pm. If we realize this second possible origin of the letters 
m,m + 1, . . . ,m + p — 1, then the proof is the same as for (ii). 

(i) If ti > 1, then 00 is a factor of u. Hence, for all n G N the word 93^^(0)0 = • • • (0 © n)0 
is a factor as well and so Lext(O) = A. 

Let ti = 1, it implies U G {0, 1} for i = 1, . . . , m + p. It holds (/?^°(01) = ipj3{{(.Q - 1)4) = 
^0(4 + 1), hence, + 1, . . . ,m + p — 1 G Lext(O). But dp(l) cannot contain a sequence 

of consecutive O's shorter than £o due to Parry condition ([T| and so £o is the least letter in 
Lext(O). 

□ 

The previous lemma allows us to partially determine all LS factors of u^. 

Corollary 32. Let v he a LS factor of up containing at least one nonzero letter, then one of 
the following factors is a prefix of v. 

(i) 0*11, 

(ii) 0*m, 

(Hi) O'^A;, if k > m and t = tm+i = tm+2 = • • • = t^-i = 0. 

Note, that the factors from the last point are images of the factor 0*m in the case when 
t = 0. 

Lemma 33. For the fixed point of (pjs it holds 

(i) if {k,t) is unordered couple of distinct letters of A such t/iat Rext(A;) nRext(£) ^ 0, and 
{k,£) ^ {m-l,m + p- 1), then fL{k,l) = e and gL{k,i) = {k®l,£® 1}, 

(ii) fbijn — l,m + p — 1) = 0*m and giijn — 1, m + p — 1) = {0, z}, where 

fl + 2;(m-l) iftm<tm+p, 
z = < (7) 
[l + z{m + p-l) iftm+p 

Proof (i) follows directly from the definitions of gi, fi and (pp. {ii) is a simple consequence 



of Lemma 31 Remark that if > tm+p ^ 0, then z{m + p — 1) = y{m + p — 1). 

□ 
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Now we have the knowledge necessary to complete the graph GLip„ but still we have to 



prove that the substitution ipp satisfies Assumptions ^T] and 24 



Lemma 34. The substitution ipp from Definition^ satisfies Assumptions 21 and 24 



Proof. The fact that Assumptions 21 is fulfilled follows from Lemmas 31 and 33 



To construct an /-preimage for an arbitrary infinite LS branch is easy due to Lemma 29 
part (m), Corollary 32 and Lemma 33 

□ 

Now, we know all we need to be able to construct the graph GL^p^. For the case when 
ti > 1, the graph is depicted in Figure [6} Since Lext(O) = A, all possible unordered couples 
of letters are vertices of the graph. If z is not a multiple of p (i.e., the decision condition 
z = sp in Figure [g] returns no), then the graph contains only cycles with edges labelled by e 
only. li z = sp for certain positive integer s, then there is the cycle on vertices (0, z), (1, z © 
1), . . . , (m — 1, z © m — 1), where the edge from the vertex (m — 1, z © m — 1) to the vertex 
(7i(m — 1, z © m — 1) = (0, z) is labelled by /L(^T^ — 1, z © m — 1) = 0*m. 

If ti = 1 the graph GL^^ is the same as in Figure |6] but we have to remove vertices (/c, i), 
where k < io or i < io and (fe, i) ^ (0 © n, z © n) for any n £ N. What is important for our 
purpose is that the structure of cycles is the same for arbitrary value of ti. 

k ^ sp 



1 


e 




m 


e 




m + p— 1 
k ® m + p — 1 


fcffi 1 




k®m 







Figure 6: GL^^ for non-simple Parry /3, s is a positive integer. 

Since the fact whether z is or is not a multiple of p is crucial for the structure of cycles in 
GL^p^, we introduce the following set. 

Definition 35. A non-simple Parry number f3 > 1 is an element of a set S if and only if 
there exists a positive integer s such that z = sp, where z is the non-zero left extension of 
0*m. 



Employing Lemmas 31 and 33 one can easily prove the following. 
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Lemma 36. A non-simple Parry number /3 > 1 belongs to S if and only if one of the following 
conditions is satisfied 

a) df3{l) = ti • • • tm{0 ■ ■ ■ Otm+p)'^ and tm > tm+p, 

b) df3{l) =ti- ■ -tm-qpO—^tmitm + 1 • • ■ tm+p)^ , <? > 1, and tm < tm+p- 

Putting it all together, we obtain a proof of the following proposition which gives us the 
complete list of infinite LS branches of for all non-simple Parry numbers. 

Proposition 37. Let (3 > 1 be a non-simple Parry number and let be the fixed point of 
the canonical substitution ipp. Then 

(i) if p > 1, then is an infinite LS branch with left extensions {m,m + l, . . . ,m-\-p — l}, 

(a) if (3 ^ S, then is the unique infinite LS branch, 

(Hi) if P £ S, then there are m infinite LS branches 

There are no other infinite LS branches ofu^. 

5 Maximal LS factors 

As explained earlier, in order to determine the complexity of an infinite word, we need to 
find all infinite LS branches as well as all (a, 6)-maximal LS factors. The structure of (a, b)- 
maximal LS factors is not so simple as the one of infinite LS branches but still it can be 
described using the notion of /-image. To define an /-image for (a, 6)-maximal LS factors. 



we need Assumption 21 to be satisfied also for gji - we will say that the right version of 



Assumption |2 1 1 is satisfied. 

Lemma 38. For the substitution Lpj3 and for all distinct a,b £ A we have fR{a,b) = O*"'*", 
where 

*a,b = min{ta,tb}. (8) 



Thus, the right version of Assumption 21 is satisfied for (fjj is prefix- free. 



Definition 39. A factor v G A'^ is an {a — c,b — d)-bispecial factor of an infinite word u 
defined over a finite alphabet A if both avc and bvd are factors of u. 

Definition 40. Let a substitution ip defined over a finite alphabet A satisfy the left and right 



version of Assumption 21 and let v be an {a — c,b — d)-bispecial factor of a fixed point of (p. 



Then fL{a,b)(f{v)fii{c,d) is said to be the /-image of v. 



16 



Obviously, the /-image of w is (a — c, 6 — d)-bispecial, where 5L(a, b) = {a, 6} and gR{c, d) = 
{c,d}. 

Now, consider again the particular case of u^. A LS factor v having a,b G Lext(?;) is 
(a, 6)-maximal if Rext(au) n Rext(6t') = and so it is as well an {a — c,b — d)-bispecial for all 
c G Rext(af ) and d £ Rext(6f ). Are /-images of v again {giicL, 6))-maximal? Not all of them 
as states the following simple lemma. 

Lemma 41. Let v be a bispecial factor o/ having left extensions a and b. If its f -image 

fLia,b)M^)fRic,d) = fLia,b)ipp{v)0'^^^-'^^\ 

is {gL{a,b)) -maximal, then c £ Rext{av),d £ Rext(6t') satisfy 

tcei > max{te©ij©i | e G Rext(at;), / G Rext(6f )} 
td®i > max{te©ij©i I e G Rext(67;),/ G Rext(6u)}. 

Definition 42. An f -image of a bispecial factor v having left extensions a and b 

fL{a,b)ipf^{v)fR{c, d), 

where c G Rext(av),d G Rext(6v) satisfy ([9|, is said to be the max- /-image of v. 

The following lemma is crucial for understanding the structure of the max- /-images of 
(a, 6)-maximal factors. 

Lemma 43. If i,k £ A, £ ^ k and t£©it£©2 • • • ^ ifceiifce2 " " " ; then for all n £ N the longest 
common prefix of the factors ^^{k) and ^p'^{i), denoted by lcp{if^{k), ip^{£)), equals 

icp{^-^{k),^^i)) = ^^k)ikenr\ 

i.e., ^^{k) without the last letter k (B n. 

Moreover, denote by c the letter such that {lcp{(f'p{k),ip'p{£)))c is a prefix of ip'^{i). Then, 
tc®itcm ■■■ h ifce(n+i)*fce(n+2) • • • for all n G N. 

Proof. The case n = is trivial. 

The rest of the proof is carried on by induction on n. 

v^r^fc) = (v^^(o))*'=ev^(A:ei), 

v^r'W = (9'^(o))*'=®nv'^(o))*^®^-*'=®V^(^ei), (10) 

if ^£©1 = ffc©i, we apply the assumption of induction on lcp((^^(A; © 1), © 1)) and if 
te(si > ifcei) then on lcp{ip'^{k © 1), ^^{0)) (see Parry condition ([T|). 

As for the second part of the statement, the letter c is given by ( |10[ ) and this along with 
the Parry condition concludes the proof. 

□ 
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Lemma 44. Let n G N. The n-th max- f -image of a bispecial factor v with left extensions a 
and b, i.e., the factor we obtain if we apply n times the mapping max-f -image on v, equals 



V = s^''p{v)\cv{^]i{c),^]i{d)), 
where c G Rext(at;), d G Rext(6r;), s is given by (cf. ^) 

s = fL{gr\a,b))--.^^-\fUgLia,b))ip^-\fL{a,b)). (11) 

and 

tc®itc®2 • • • h ic'ei*c'e2 ■ ■ ■ ) 

td®ltd®2 • • • h td'@ltd%2 • • • 

for all c' G Rext(at') and d' G Rext(&w). 

Proof. The case n = is obvious, we carry on by induction on n. Let us assume W.L.O.G. 
that 

ic®l'tc®2 • • • ^ id®l'td®2 ■ ■ ■ 

and that g2{a,b) = {a,b}. Hence 

u = sv9^(v)(^^(d)((ien)-^ 

and 

Rext(6?;) = {d' ® n \ td'^i ■ ■ ■ td'®n = tdm ■ ■ ■ tden}- 



Further, let c' G Rext(ai'), then due to Lemma 43 



*c'el*c'e2 • • • ^ *d'©(n+l)*d'e(n+2) ' ' ' 

for all d' ® n £ Rext(5t)). But tdq)(^n+i) ^ *d'©(n+i) for d' (B n £ Rext(6u) and so the 
max- /-image of v equals 

/i(52(a,6))v^/3(t;)0*^e(.+i) =/^(g£(a,5))v.^(s)(^^+l(^)lcp(^^+l(c),(^;+^d))• 
□ 

Each bispecial factor v having left extensions a and b has the unique max- /-image. Since 
the substitution ipp is injective, the structure of max- /-images cannot be circular as it is for 
/-images of infinite LS branches - v cannot be the fe-th max- /-image of its own for any k. 
However, the notion of a max- /-image allows us to describe all (o, 6)-maximal factors of for 
all a,b £ A. We will prove that each (o, 6)-maximal factor is the A;-th max- /-image either of 
0*1-1 if > 1 or of if ti = 1, for some G N. A sketch of the proof is as follows. Let v be an 



(a, 6)-maximal factor containing at least two nonzero letters. Employing Lemma 29 part (ii), 
one can find a bispecial factor v such that its max- /-image is v. Again, if v contains at least 
two nonzero letters, we find a bispecial factor v such that its max- /-image is v. In this way, we 
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obtain a bispecial factor containing at most one nonzero letter such that its k-ih max- /-image 



equals v. According to Corollary 32 the only candidates for such bispecials are of the form 0* 
or O^mO'^, where 1 < s < ti and < q < ti. Note that 0*^^^ cannot be a factor of and that 
is why we consider s,q < ti. In the case when t = 0, 0*'=A;0'^, k > m,tm+i = • • • = tk-i = 0, 
could also be taken as the candidates but we do not consider them as they are just prefixes 
of (^^^"(mC). The following two lemmas tell us that 0*^~^ (resp. if ti = 1) is the only 
candidate. 

Lemma 45. Let ti > 1 and k £N. Then the k-th max-f -image of factors 0*^ 0* and 0*mO'', 
where 1 < s < ti — 1 and < q < ti, are not (a, b) -maximal for any distinct letters a and b. 

Proof. First, consider 0*^ with distinct left extensions a and b. It holds that Lext(0*^) = 
Lext(0*il) and Rext(0*i) C {k £ A \ {0} \ tk = ti}. For each k G Rext(0*i), we must have 



ifcei*fce2 • • • -< ^2^3 • • • (see Parry condition ([T|) and, due to Lemma 44 k-th max- /-image 
of 0*^ is a prefix of the k-th. /-image of the LS factor (/?^(0*^1), both having the same left 
extensions. 

Similar arguments can be used in order to prove that k-th max- /-image of 0* is always a 
prefix of fc-th /-image of the LS factor 0*^"^. Again, Lext(O^) = Lext(0*i~^) and the rest is 
implied directly by the Parry condition. 

Finally, consider the LS factor 0*rn-0^ having just two left extensions and z (see ([7])). In 



accord with Lemma 44 the m-th max- /-image of 0*^ ^ with left extensions and p equals 

0*m(/5^(0*i-^)(^;^(l)(m + 1)-^ = 0*mO*il • • • . (12) 

Indeed, Rext(00*i-i) = {k £ A \ {0}\tk = ti} and £ Rext(pO*i-i) and so the fact 
that ifc©itfc©2 ■ ■ ■ ~< ^2^3 ■ ■ ■ and the Parry condition imply that the m-th max- /-image is 



0*m</7j^(0*i^i)lcp((/?J^(0),(/?^(l)). Thus, 0*mO«, as a prefix of is not a (0, z)-maximal. 

□ 

Lemma 46. Let ti = 1 and /c E N. Then t = and the k-th max-f -image of the factor mC, 
where < q < I, is not (a, b) -maximal for any distinct letters a and b. 

Proof. As in the proof of the previous lemma, we can prove that the (m — lo)-th max- /-image 
of with left extensions Iq and Iq -\- p is the factor 

m^^{l){m + l)-\ (13) 

where, according to Lemma [29] part {i), 

^^{1) = {^J-Ho)y-{^^-\o)Y^ ■ ■ • ((^^(o))*-o*-+Hm + 1). 

In order that mO may be (0, 2;)-maximal, it must be 9?^(1) = m -\- 1 and so t2 = ■ ■ " = 
tm+i = 0. But is is not possible due to the Parry condition since then tit2 -< {tm+p = 

l)^m ■ ■ ■ im+pt{m+p)lSl ' ' ' ■ 

□ 
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Proposition 47. Let v be an (a, b) -maximal factor of up. Then there exists A; G N such that 
V is the k-th max-f -image of 



(i) 0*1-1 ifti>l, 

(ii) ifti = 1. 

Proof. We will prove that if v contains at least two nonzero letters, then it is the A;-th max- 
/-image of a bispecial factor of the form 0* or 0*mO'', where 1 < s < ti and < g < ti. The 
rest of the proof then follows from the previous two lemmas. 



Let us assume that v contains at least two nonzero letters. Then, due to Lemma 29 part 
(ii), V = fi{a',b')ipf3{v)fR{c',d'), where ij is a (a' — c',b' — d')-bispecial factor such that v is 
the max- /-image of v and gLia' , b') = {a, b}. Analogously, if v contains at least two nonzero 
letters, there exists an (a" — c" , b" — (i")-bispecial factor v which is an /-preimage of v. But it 
must be also a max- /-preimage, if it is not, then vO'^' is also the /-image of v having the left 
extensions a' and b' for some q' > and so v cannot be (a, 6)-maximal as it is a proper prefix 
of the max- /-image of LS factor vO'^' with the left extensions a and b. Using this argument 
iteratively, we will obtain a bispecial factor of the form 0* or 0*m-0^ such that v is its A:-th 
max- /-image. 

□ 



In fact, the previous proposition along with Lemma 44 provides us with the complete list 
of (a, 6)-maximal factors. However, in the last section of this paper we will need to know some 
details to be able to determine under which conditions the complexity of is affine. 

Corollary 48. // 7^ ti(0 • • • 0(ti — 1))'^, then the k-th max-f -image of the factor (12) 

is (5^(0, z))-maximal for all A; G N. 

If (3 ^ S, then the k-th max-f -image reads 



Proof. The factor (12) is always LS with just two left extensions and z. Therefore it is 
(0, 2;)-maximal if it is neither a prefix of any infinite LS branch or a proper prefix of the k-ih 
max- /-image of its own for certain A; > 0. 

In the case when (3 ^ S, the longest common prefix of the A;-th max- /-image of the 



factor (12) and of the unique infinite LS branch equals 

Hence, either it is non-empty and shorter than the longest common prefix of the (A; -|- l)-th 



max- /-image of (12) and of or it is empty, k < p and t = tm+i = • • • = tm+k = (or 



only t = for A; = 0). In the latter case, the A;-th max- /-image of (12) begins in letter m-\- k 



which is different from the first letters of U/3 and of all other max- /-images of (12). Putting 



all together, the A;-th max- /-image of (12 ) is neither a prefix of or of the ^-th max- /-image 



of (12) for any i ^ k. 
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If /? G 5, then is not the only one infinite LS branch, there are m other branches 

ui = 0*m(^;^(0*m)(^^'^(0*m) • • • (14) 

and ui = ip^^^(ui),l = 2, . . . , m. To finish the proof, we have to foreclose the possibility 
that the factor (|12|) is prefix of ui. Looking at (14) and (12), we see that it happens only if 



t = ti — 1 and m = 1, in other words, if (i^(l) = ti(0 • • • 0{ti — l))"^. The proof of that the 
factor ( 12 ) is not a prefix of any max- /-image of its own is analogous to the one above. 

□ 

Corollary 49. If ti > 1, then the k-th max-f -image o/0*^~^ with left extensions and a is 
a (^1(0, a))-maximal factor for all a ^ A \ {0, z} and for all < k < m. 
Moreover, put 



(15) 



-1 ift^ti-1, 
if t = ti — 1 and t2 ^ tm+i 

max{£ G N | t^+i / tme^} otherwise, 

then the k-th max-f -image of O''^^^ is also (g'KO, z)) -maximal factor for all k^ < k < m. 

Proof. It holds 

Rext(00*i~^) = {k£A\{0}\tk = tiovk = m and tm+p = ti} 

and for all a G „4 \ {0} we have k G Rext(aO*^~^) if and only if A; = or the both following 
conditions are satisfied: 



(i) z{k) = a — 1 or y{k) = a — 1, 

(ii) tk = ti — 1 or k = m and tm+p 



1. 



The intersection of Rext(00*^~^) and Rext(aO*i~^) is not empty if and only if a = z and 
t = ti — 1, in other words, if and only if 0*^~^ is a prefix of 0*m what is a LS factor having 
just two left extensions and z. 

Similarly, we can prove that the k-th max- /-image of 0*^~^ is (5r^(0, a))-maximal factor 
for all a G .4 \ {0, z}. Also similarly, the A;-th max- /-image of 0*^^^, namely 

ip'^{o'^"')4{i){k + ir\ 

is (5^(0, z))-maximal if it is not prefix of the LS factor 



having the left extensions g\{^^z). The proof then follows from Lemma 29 part (i) and 
applied on v^^(l) and ip'^{m). 

□ 



Lemma 



43 
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Taking into account Lemmas 31 and 47, one can prove the following corollary using analo- 
gous techniques as in the proof of the previous one. Note that Rext(£oO) = {k £ A \ z{k — l) = 
£o — 1 or y{k — 1) = — 1} and Rext(aO) = {1} for all a > Iq, i.e., is {£o,^o + 2;)-maximal 



if it is not a prefix of the Iq-Hi max- /-image of the factor (12) which reads 



(^J«(m)(^™+^o(l)(l e (m + lo))-' = ^'/^{m)^^{l, + 1)(1 © (m + £0))"^ 



Corollary 50. If ti = 1, then the k-th max- f -image of is {§^{1^, a -\- Iq)) -maximal factor 
for all letters a > Iq, a ^ z and for all < k < m — io- 



Moreover, k-th max-f -image of is {gJ^{£Q,z -\- £o))-maximal if ko > io and k = ko 



CO, KQ 



^0 + 1, 



, m 



io, where ko is defined by (15) 



6 Affine complexity 

The aim of the present section is to find the necessary and sufficient condition for the factor 
complexity of being affine. In order the complexity to be affine, the first difference of 
complexity AC(n) must be constant. The following lemma says when AC(n) can change its 
value. The proof is an immediate consequence of ([2]). 

Lemma 51. Let u be an infinite word over a finite alphabet. 

(i) If AC{n -\- 1) >AC(n), then the number of LS factor of length n-\- 1 is greater then the 
number of LS factor of length n. 

(a) If AC{n-\-l) <AC{n), then u contains (a, b)-maximal factor of length n for some letters 
a and b. 

That is, the complexity is affine if either u does not contain any (a, 6)-maximal factor and 
all infinite LS branches have empty common prefix or if each (a, 6)-maximal factor of length 
n is "compensated" by appearing of a "new" LS factor of length n -|- 1 . Examples of the first 
case are Arnoux-Rauzy words whose all LS factors are prefixes of unique infinite LS branch. 
As for the latter case, appearing of a "new" LS factor of length n -|- 1 means there are a LS 
factor V of length n and its right extensions c and d such that vc and vd are both LS, i.e v 
is the longest common prefix of two different LS factors - Cassaigne [9] call such LS factors 
strong bispecial. 

Since comprises always at least one (a, 6)-maximal factor, each such (a, 6)-maximal 
must be as long as the longest common prefix of two different LS factors in order that the 
complexity may be affine. We will prove that it is possible only if the number of (a, 6)-maximal 
factors is finite, thus in the case of dp^l) = ti(0 • • • 0(ti — 1))'^. 

Lemma 52. If ko < m — 1, where ko is defined by ( [Xs] ), then the factor complexity of is 
not affine. 
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Proof. If /co < m — 1, then the {ko + l)-th max- /-image, if ti > 1, (resp. (/cq — + l)-th if 
ti = 1) of 0*^"^ (resp. 0) is 5f^''(0, 2:)-maximal. Consider the longest common prefix of the 
LS factor (/7^''(0*m) having left extensions ^^"(0, z) and of the infinite LS branch u^, if p > 1, 
or of the LS factor f^^^ (0^^^^) with left extensions m — 1 and m, if p = 1 (and so i 
not an infinite LS branch). This factor equals (/9^"(0*m)(m © A;o)~^ which is a prefix of the 
{ko + l)-th max- /-image (resp. (/cq — + l)-th if ti = 1) of 0*^~^ (resp. 0) and hence it is 
not (a, 6) -maximal for any distinct a,b ^ A. Overall, AC(no) <AC(no -|- 1), where uq is the 
length of the factor (^^"(0*m)(m ko)~^. 

□ 

Lemma 53. If dp^l) = ti(0 • • • 0{ti — 1))^ , then the factor complexity of up is affine, namely 
C(n) = pn + l,n 

Proof. In this case, t = ti — 1 and so /cq = = m — 1. Hence, the (0, a)-maximal factor 
0*1 is at the same time the longest common prefix of the only infinite LS branches 
and 0*m(/3^(0*m)(/9^(0*m) • • • . But is the only (a, 6)-maximal and prefixes of these two 

infinite LS branches are the only LS factors of u^, thus, the proof is complete. 

□ 

Lemma 54. If (3 £ S and (i^(l) ^ ti(0 • • • 0(ti — 1))"^, then the factor complexity of up is not 
affine. 



Proof. In the case when p > 1, there are m + 1 infinite LS branches given by Proposition 37 
Let us denote them by uq = u^, ui, . . . , and put 

no = max{|i;| | v = lcp(uj,Uj),i / j,i,j = 0, 1, . . . ,m}. 

We have AC(n) > #Lext(uo) — 1 -|- X^^Li #Lext(ufe) — 1 > p — 1 -|- m for all n > no. Due to 



Corollary 48, we know that there exist infinitely many (5^(0, 2;))-maximal factors. A: = 0, 1, . . ., 
and hence there must exist a LS factor of length rii > tiq which is not a prefix of any LS 
branch and so AC(ni) > m + p — 1 =AC(1). 

In the case of p = 1, the proof is analogous. Only difference is that there are only m 
infinite LS branches since is not. □ 

Remark 55. For the word up with dp{l) = ti(0---0(ti — 1))*^ we may easily describe all 
left special factors. If the length of the period p > 1, each LS factor is a prefix of one of two 
infinite LS branches up and 0~"'^u^. Ifp = 1, then up is not an infinite LS branch and so each 
LS factor is prefix of the unique infinite LS branch 0~^up. Hence, we obtain the known result 
that Up is Sturmian if and only if dp{l) = ti(ti — 1)^ . We were pointed out by Christiane 
Frougny that numbers f3 satisfying dp{l) = fi(0---0(ti — 1))'^ are Pisot units. Such Parry 
number j3 is a root of the polynomial x^^^ — tix^ — a; -|- 1. 

Lemma 56. Let (3 ^ S and let ko > m — 1. Then the factor complexity of up is not affine. 
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Proof. As shown in the proof of Lemma 52 the k-th max- /-image of 0*1-^ (resp. if ti = 1), 
= 0, 1, . . . , m — 1, is not beginning in 0*m and it is equal to the longest prefixes of some two 
LS factors. In order that the complexity is affine, also all max- /-images of the factor (12) 
must be as long as the longest prefixes of some two LS factors. 

Let ti > 1. Then the factor (12 ) must be of the same length as the longest common prefix 
of and m-th max- /-image of its own - remember that the longest common prefix of and 
A:-th max- /-image of ( 12 ) is the /c-th max- /-image of 0*^~^ for A; = 0, 1, . . . , m — 1. Formally, 



= \lcp{up,ip^{0'm)^f'{0'--H){l(B{2m)r')\ = \^^{0'm){m (B m)-'\ 

which is never satisfied for |(^™(0*m)| < |(/9™(0*^~^1)|. 

Let ti = 1. Following the same reasoning as for the case ti > 1, a necessary condition for 
the complexity to be affine is that the factor ( |12[ ) 

must be of the same length as the longest common prefix of the (m — io)-th max- /-image of 
its own and u^, namely 

|lcp(u^, ^o(i)(i ^ (2m - £o))-')\ = |<^J-'°(m)(m (m - £o))-'\ 

which is never satisfied for \ipj'^%m)\ <\ip'^{l)\. □ 

Putting all lemmas of this section together, we obtain the main theorem of this paper. 

Theorem 57. Let (3 be a non-simple Parry number. The factor complexity of is affine if 
and only if d/sil) = ti{0 ■ ■ ■ 0(ti - 1))"^. 

7 Conclusion 

Among infinite words associated with Parry numbers we may identify Arnoux-Rauzy 
words. An infinite word is said to be Arnoux-Rauzy of order if for any length n G N 
there exists exactly one left special factor and one right special factor both of length n and, 
moreover, these special factors have just I left and £ right extensions respectively. Arnoux- 
Rauzy words can be considered as a natural generalization of Sturmian words to more letter 
alphabets. 

Is is easy to see that only Sturmian words among correspond to /3 with dp{l) = til 
or (i/3(l) = — 1)"^. The word is an Arnoux-Rauzy word of order m > 3 if and only 
if (i/3(l) = t™^^l. It means that there is no Arnoux-Rauzy word over more letter alphabet 
associated with non-simple Parry number. A direct consequence of the definition of Arnoux- 
Rauzy words is that the complexity of Arnoux-Rauzy word is affine and that any left (right) 
special factor is a prefix of an infinite left (right) special branch. 
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In the previous section, we have proved that the infinite word associated with a non- 
simple Parry number (3 has the affine complexity if and only if (i/3(l) = ti(0 • • • 0{ti — 1))^. In 
fact, we have proved that the complexity is affine if and only if any left special factor of 
is a prefix of an infinite left special branch. The validity of the same statement for infinite 
words associated with simple Parry numbers is proven in jl] . However, this equivalency is not 
a general rule for the factor complexity of fixed points of primitive morphisms. For a counter 
example see [10] and [13] . 

It is known that Sturmian words have many equivalent definitions, see ^ for more. In 
2001 Vuillon |28j showed that a binary infinite word is Sturmian if and only if each its factor 
has exactly two return words. In the article [2.7' Vuillon introduced the property R^: an 
infinite word satisfies the property Rg if each its factor has exactly £ return words. Therefore, 
words with Ri can be considered as another generalization of Sturmian words. In [T7| Justin 
and Vuillon proved that Arnoux-Rauzy words of order i have the property Rg. Applying 
Theorem 4.5 of [2], we see that all u/^ with affine complexity have also the property Rg. 
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